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ABSTRACT 



The "ALFA Ultra Deep Survey" (AUDS) is an ongoing 21-cm spectral survey 
with the Arecibo 305m telescope. AUDS will be the most sensitive blind survey 
undertaken with Arecibo's 300 MHz Mock spectrometer. The survey searches for 
21-cm Hi line emission at redshifts between and 0.16. The main goals of the 
survey are to investigate the Hi content and probe the evolution of Hi gas within 
that redshift region. In this paper, we report on a set of precursor observations 
with a total integration time of 53 hours. The survey detected a total of eighteen 
21-cm emission lines at redshifts between 0.07 and 0.15 in a region centered 
around a^ooo ~ h , 5 « 15°42 / . The rate of detection is consistent with the one 
expected from the local Hi mass function. The derived relative Hi density at the 
median redshift of the survey is pm[z — 0.125] = (1.0 ± 0.3)po, where p is the 
Hi density at zero redshift. 

Subject headings: cosmology: observations - galaxies: evolution - radio lines: 
galaxies 



Introduction 



1.1. Neutral hydrogen surveys of the Local Universe 



Large- area blind Hi surveys have come of age since the first 21-cm multibeam receiver 
was installed at Parkes Observatory in 1997, to be followed by similarly powerful receiving 
systems at the Lovell telescope, the Arecibo telescope and the Effelsberg telescope. These 
surveys have allowed extensive areas of sky to be mapped for extragalactic neutral hydrogen, 
resulting in the detection of over 10 4 galaxies ( [Meyer et al]|2004t |Wong et al||2006t |Giovanelli 



et al. 


2005; 


Winkel et al. 


2008) 



expanded the sky coverage even further (e.g. Rivers et al. 1999; Henning et al. 2000, 2010). 
Prior to this, the most extensive neutral hydrogen surveys were those carried out with the 



single-beam or dual-beam systems at Arecibo: the Arecibo Hi Strip Survey (iZwaan et al. 



1997[ AHISS) the Arecibo Slice (Spitzak & Schneider 1998) and the Arecibo Dual Beam 



1 The Arecibo Observatory is part of the National Astronomy and Ionosphere Center which is operated 
by Cornell University under a Cooperative Agreement with the National Science Foundation. 
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Survey ( |Rosenberg fe Schneider||2000| ADBS). These were all limited by the available band- 
pass to maximum recessional velocities of around 8000 kms -1 (z~ 0.03) and had relatively 
slow mapping speeds compared to the more recent multibeam surveys. 

Multibeam surveys have adopted the strategy of going wide rather than deep, since 
this generally results in the largest observed volume and the greatest number of detected 
objects. If a survey in the local Universe is carried out with an integration time of 1 unit, 
then extending it to t units by making it wider will increase the volume covered by a factor of 
i, while extending it with a longer integration in the same area will increase the volume by a 
factor of i 3 / 4 , if not limited by receiver bandwidth and the changing beamsize with frequency 
is neglected. Going very deep is generally only advantageous if it becomes possible to reach 
a new population of galaxies, e.g. outside the local region, or galaxies at an earlier epoch in 
their evolution. Therefore, most survey time at Parkes and Lovell was spent on the relatively 



shallow Hi Parkes All Sky Survey ( [Meyer et al.][2004t HIPASS) and the HI Jodrell All Sky 
Survey ( Lang et al.|2003[ HIJASS) respectively. These surveys were limited by sensitivity (as 
well as the available bandpass) to maximum recessional velocities of less than 14000 kms -1 
(z~ 0.04), even for the most massive systems. The deepest field survey carried out with the 



Parkes system (Minchin et al. |2003 , HIDEEP) reached a noise level of only 3.2 mJy (at a 
velocity resolution of 18 kins" 1 ), considerably higher than AHISS (0.75 mJy at 18 kms -1 ). 
The most sensitive Hi surveys in the local universe are those pointed towards the CVn group 



(Kovac et al. 2009) and the Virgo cluster (Popping & Braun 2007). 



Observational programs on the newer Arecibo L-band Feed Array (ALFA) system at 
Arecibo have followed a similar pattern with the largest surveys being devoted to large areas 
of sky. The Arecibo Legacy Fast ALFA Survey ( |Giovanelli et al.||2005[ |2007| ALFALFA) 
is covering 7000 deg 2 to a noise level of 2.2 mJy (at 10 kms -1 ) and the Arecibo Galaxy 



Environment Survey (Auld et al. 2006, AGES) is covering 200 deg 2 to a noise level of 0.75 
mJy (at 10 kins" 1 ). 



1.2. Hi observations above z=0.1 

Currently, the Cornell Hi digital archive (Springob et al. 2005), a homogeneous database 
of pointed Hi observations of about 9000 optically-selected galaxies, includes only a few 
objects with recessional velocities larger than 20,000 km s _1 . Even the ongoing ALFALFA 
and AGES surveys mentioned above only reach maximum recessional velocities of 18,000 
and 20,000 km s _1 , respectively. However, in spite of challenges such as limited sensitivity 
and radio-frequency interference (RFI), some progress in the detection of Hi emission from 
galaxies at redshifts beyond z ~ 0.1 has been made with deep integrations of galaxy clusters 
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with the Westerbork Synthesis Radio Telescope (WSRT) flZwaan et al. 2001; Verheijen et 



al. 2007), pointed Arecibo observations of Ha-emitting galaxies (Catinella et al. 2008), and 



'stacked' Giant Metrewave Radio Telescope (GMRT) integrations of fields with existing 
redshift information ( Lah et al.|2007 , 2009). The parameters of these surveys are summarized 
in Tab. [T] These observations have shown that it is possible to study the gaseous properties 
of galaxies in environments different to the local Universe, and at look-back times of up to 
2-4 Gyr. Nevertheless, selection effects, limited sensitivity, cosmic variance all conspire to 
make it challenging to obtain the accurate measurements required to quantify systematic 
evolutionary and environmental trends. 



1.3. Evolution of the Gas Content of the Universe 



The rate of conversion of gas into stars is one of the fundamental quantities that describe 
the evolution of galaxies and remains a difficult measurement to make for all but the most 
nearby galaxy populations (e.g. Madau et al.||1996 ; Haarsma et al.||2000 ; Bouwens et al.||2004 



Wong 2009). At redshifts between zero and unity, the global co-moving star- formation rate 



appears to increase by a factor of five (Hopkins 2004). Molecular gas content possibly also 



increases over this redshift range (Daddi et al. 2008). 



Early models of the evolution of cold gas predicted a constant conversion factor between 
gas supply and star formation and therefore led to a similar strong evolution of the cold 



gas content (Pei et al. 1999). Observations of local galaxies suggest that both hot and 



cold gas accretion significantly contribute to replenish the gas content of galaxies (see e.g. 



review by Sancisi et al. 2008), and the balance between the two might depend on redshift. 



The modern picture of a self-regulated equilibrium between the net inflow of gas and star 
formation leads to predictions of somewhat smaller evolution of the cold gas content at 
redshifts lower than two QObreschkow fc Rawlings||2009 ; Power et al.||2009 ). This appears to 
fit observations of damped Lyman-a systems (DLAs, e.g. Rao et al.|2005 ; Storrie-Lombardi& 
Wolfe||2000 ; Prochaska et al.|2004 , 2005[ ) which suggest that the redshift density of absorbers 



with N H > 2x 10 20 cm 2 increases only mildly, dN DL A/dz = 0.05(l+z) L1 , which is consistent 
with constant co-moving gas density. 

Conversion from DLA measurements to a cosmic gas density ^hi is unfortunately fraught 
with uncertainty, largely due to the small number of observed systems, but also due to 
uncertain systematic effects such as lensing and covering factor. Better measurements of 
the gas mass density from intermediate redshift DLAs are presently not possible because 



of the limited availability of space-based UV instruments. Rosenberg & Schneider (2003), 



Ryan- Weber et al. (2003), and Zwaan et al. (2005) compared local Hi observations with the 
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DLA number and argue that the number of galaxies found in their "blind" Hi surveys is 
consistent with the number densities of DLAs at redshifts out to z « 1.5. 

Unfortunately, direct measurement of the Hi in emission at high redshift is difficult. 
Bridging the gap between 21-cm surveys at z~ and DLA surveys at high z, for example, to 
measure the cosmic Hi mass density, requires deeper and more extensive surveys in the 21-cm 
line than currently available. To be useful, such measurements ideally need to (a) have better 
accuracy than existing measurements; (b) preferably not be tied to 'biased' optical samples 
or galaxy overdensities; (c) be over a large enough cosmic volume that cosmic variance is not 
a limitation; (d) be at a significant redshift or look-back time; and (e) sample the Hi mass 
function below the 'knee' where the bulk of the gas may may locked up. Unfortunately, 
achieving all of the above probably awaits the Square Kilometre Array (SKA). However, 
many of the above goals can be partially achieved with a deep survey of a small field with 
the ALFA instrument on the Arecibo telescope. Such a survey is currently in progress at 
Arecibo. This "Arecibo Ultra Deep Survey" (AUDS) is the deepest blind survey ever carried 
out with the Arecibo telescope, the world's largest single dish radio telescope. The full survey 
should allow the accurate measurements of the HI mass function up to the redshift limit of 
ALFA of z~ 0.16, corresponding to a look-back time of about 2 Gyrs or roughly 25% of the 
time between z~ 1 and z~ 0. In this paper, we report on a precursor project for AUDS 
which tested our ability to achieve the very low noise levels required by the full survey. 

2. Arecibo Ultra Deep Precursor Survey 
2.1. Motivation 

The goal of the AUDS is to carry out a blind 21-cm line survey at the highest redshift 
possible using the Arecibo L-Band Feed Array (ALFA). A fundamental requirement for the 
survey is the ability to survey a large volume to the highest practical redshift, which is 
given by the bandpass of the ALFA receiver and is about z~ 0.16. Sampling galaxies that 
contribute most of the gas mass to the Hi mass function at z^ requires sufficient sensitivity 
to detect some galaxies in the flat part of the luminosity function, i.e. galaxies down to at 
least 10 9 M . 

With a linewidth of 200 km s" 1 , a galaxy with M HI = 10 9 5 M has an integrated flux of 
0.03 Jy km s _1 and a mean flux density of only 140 /xJy at the far end of our redshift range. 
A sensitivity of 50 /iJy is necessary to detect such a galaxy with at least 3a significance per 
channel. AUDS will use a total of 1200 hours of observing time to survey a total area of 
1/3 square degrees with such a sensitivity. The integration time per pointing will be about 
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40 hours. This will be the longest integration time per pointing of any 21-cm survey ever 
conducted with the Arecibo telescope. 

Before committing this large amount of observing time, we carried out precursor obser- 
vations to test the feasibility of extremely deep surveys with the Arecibo telescope. These 
precursor observations were carried out from October 2004 to February 2005. The maximum 
redshift at which Hi line emission can be probed by AUDS is z= 0.16 and is set by the ALFA 
multibeam receiver, which operates in the 1225—1525 MHz frequency range. However, the 
lower end of such interval (i.e., frequencies below 1270 MHz, or z> 0.12) is heavily affected 
by RFI. Thus we carefully chose a suitable target field with previously measured optical 
redshifts that would guarantee a few Hi detections across the whole redshift range and that 
would allow us to test our algorithms of signal extraction in presence of strong RFI. Note 
that for this precursor survey an interim backend with a total bandwidth of 100 MHz was 
used. The redshift range from z = to 0.07, which will be sampled by the full survey, was 
therefore not accessible by the precursor survey. 



2.2. Mapping Strategy 

To reach the deepest possible observations with the multibeam array ALFA, we used a 
"drift and chase" mode to cover the survey region in a uniform manner. For that purpose, 
we rotated the feed array to an orientation so that a single position on the sky drifts through 
the three central beams. The basic strategy was to point 10 arcminutes ahead of the target 
field with the central beam, and then let the sky drift past for 100 seconds of time (or 
20 arcminutes). Subsequently, the telescope drove back to the starting point and the whole 
procedure was repeated. The feed array remained stationary during each drift, and rotated to 
track the sky only between drifts. This strategy is ideal for Arecibo because of its sensitivity 
to ground spillover. Keeping the orientation and therefore the spillover constant during each 
drift results in the best possible baselines. At the same time, the "on" and "off" source 
integration are as close as possible in time and allow removal of temporal changes in receiver 
gain and interference. 

An additional advantage of drift scanning is that positions and fluxes of sources can 
be measured at the full telescope resolution at least in the scanning direction. The goal of 
the full AUDS will be to obtain fully sampled 2-d maps. This is achieved by introducing 
offsets of about l/10th of the beam width between the individual drifts. However, to reach 
the highest possible sensitivity for the precursor observations, we repeated all our scans at 
exactly the same Declination. The resultant sky coverage therefore consists of three disjoint 
strips, each of length 1500 arcseconds. This footprint is shown in Fig. [TJ Only a small part 
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of the central strip is covered by three beams in each of the scans. 



2.3. Target Selection 

As explained in the previous section, the deepest part of the survey was a strip with the 
extension in Declination given by the size of the ALFA beams, and about 13 arcminutes long 
in Right Ascension. To make the precursor observations a useful test of our ability to detect 
galaxies, we have used the SDSS DR2 ( Abazajian et al.||2004 ) spectroscopic data base to 



identify a target field with the following characteristics: (a) observable from Arecibo during 
nighttime; (b) including at least one Hi-rich galaxy in the redshift range 0.09 < z < 0.12, 
corresponding to the relatively RFI-free 1270—1300 MHz frequency window; (c) including 
at least one Hi-rich galaxy at lower redshift; and (d) not contaminated by strong radio 
continuum sources. In practice, we generated a list of good candidates for Hi detection in 
the highest redshift end of our survey, and visually inspected the SDSS images, looking for 
presence of additional disk galaxies within a few arcminutes from the main target. The 
selection criteria for good Hi candidates are similar to those successfully used by |Catinella 



et al. (2008) to identify Hi-rich galaxies at intermediate redshifts (z> 0.16) detectable with 
Arecibo. Basically, we looked for relatively isolated, non-interacting galaxies with disk mor- 
phology, presence of Ha line emission, late-type spectra, and redshift as close as possible to 
our limit, z= 0.16. 

The field selected for our precursor observations is shown in Fig. [2} There are a total of 



31 galaxies with measured redshift listed in the SDSS DR7 (Abazajian et al. 2009). Two of 
them are in the foreground of our survey at z < 0.07, 17 of them within our redshift range 
0.07 < z < 0.15, and 14 in background at z > 0.15. There are more than one thousand 
additional SDSS galaxies in the field without optical redshifts. The field includes several 
late-type galaxies at different redshifts. Some of them are at or close to frequencies which 
are affected by RFI for a large fraction of the time. 



2.4. Observations and Data Reduction 

We used the Wideband Arecibo Pulsar Processors (WAPPs) with 100 MHz bandwidth 
and 4096 channels (3- level sampling). This resulted in a channel separation of 24.4 kHz (5.2 
kms -1 at z= 0). For this experiment, the WAPPs were tuned to a central frequency of 1275 
MHz, giving coverage over the range 1325 - 1225 MHz (corresponding to z= 0.07-0.16). 
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Fig. 1. — Footprint of the AUDS precursor survey. Dashed circles are the positions of the 
seven ALFA beams at the start of a scan, and dotted line at the end of a scan. The central 
of the three strips is covered by three beams, whereas the outer two strips are only covered 
by two beams. 
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We spent a total 53 one-hour sessions testing our "drift-and-chase" procedure on this 
target field. In the most sensitive part of the survey, the accumulated integration time per 
pointing was about 14 hours. The total telescope time spent to carry out these observations 
was about 50% higher than the actual time spent on the drift scans. A significant fraction 
of these overheads was due to time losses at the start of each observing session and before 
and after each drift scan. For the full AUDS survey, overheads are expected to be about 
25% since individual strips are longer. 

The measured rms in the spectra of the most sensitive part of the precursor survey is 
about 80 /xJy per 24.4 kHz channel throughout the bandpass, see Fig. [3j This is close to 
the expectations based on system temperature of 28K, gain of 9 K/Jy, and the accumulated 
integration time per pointing of 14 hours. In Fig. [4| we show the measured rms as a function 
of integration time at a frequency of 1270 MHz. It can be seen that the rms noise of the 
averaged spectra declined as a oc y/l/ t int down to the longest achieved integration time. 

Data reduction was performed using the Livedata and Gridzilla multibeam processing 
packages. Livedata performs bandpass estimation and removal, Doppler corrects the data, 



and calibrates the resulting spectrum (Barnes et al. 2001). The packages were modified by 



one of us (MC) to read Arecibo data, and new routines to reject RFI were implemented. 
Current versions of the Livedata/ Gridzilla packages include these routines and are available 
at |http : //w ww . atnf . csiro . au/computing/sof tware/livedata .html[ The bandpass was 
calculated independently for each drift, beam, and polarization. A clipped median was used 
to compute a reliable calibration factor for each spectral channel in the presence of strong 
and persistent RFI. 

After the bandpass calibration, data points with RFI were flagged, both with an auto- 
matic procedure and manually (see below). Finally, flagged calibrated data were combined 
into a data cube using Gridzilla. For the precursor experiment, the array configuration was 
optimized to provide depth rather than uniform coverage, with the beams forming three 
independent strips separated in Declination. In order to grid this efficiently we chose a 
non-isotropic grid-size, with pixels 1 arcminutes in Right Ascension and 5.2 arcminutes in 
Declination. This gave one row of pixels for each strip. The cube was formed by median- 
combining the data from the calibrated spectra, weighted so as to correctly reconstruct the 
flux of a source at the pixel center. The final data cube had a size of 45 pixels in Right 
Ascension, 3 pixels in Declination, and 4096 spectral channels. 



- 11 - 



100 



90 - 



80 - 



B 



70 - 



60 - 



50 




1220 1240 



1260 1280 
f [MHz] 



1300 1320 



Fig. 3. — Measured rms in unsmoothed spectra with exposure times longer than 13.5 hours. 
All measurements were performed in parts of the spectra without line emission or RFI. Solid 
squares are measurements on extracted galaxy spectra close to the detected Hi lines, whereas 
open squares are measurements at the center of the field. 
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Fig. 4. — Measured rms at 1370 MHz in unsmoothed spectra along the central strip of the 
survey as a function of the total accumulated integration time. The line is a fit to the data 

of a OC ^I/tint 
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2.5. RFI 

All precursor data are impacted to some extent by RFI at various frequencies. In about 
70% of all collected data, RFI was limited to well known frequencies as listed in the observa- 
tory RFI catalog. In about 30% of the data, RFI at unusual frequencies occurred, sometimes 
wiping out large fractions of the bandpass. Our strategy for removing RFI was twofold. First, 
we implemented an automatic RFI removal routine based on 3.5-sigma clipping of the data 
smoothed in time-frequency space. Subsequently, we carefully edited out data sets with RFI 
not removed by our automatic procedure. The fraction of data rejected because of RFI as a 
function of frequency is shown in Fig. [5j It can be seen that RFI appeared at all observed 
frequencies during the observations. About 6% of the band is severely affected by RFI with 
10% or more of the data points being flagged. This corresponds to about 8% of the sampled 
volume affected heavily by RFI. 

One concern for any deep survey are possible spurious detections of line emission caused 
by unrecognized RFI. Such a situation is most likely to happen close to spikes in the RFI 
spectrum. For example, the detected sources 2-4 and 2-6 are close to RFI spikes (see section 



3.1). We therefore re-examined this part of the spectrum and did not find any indication 
of undetected RFI. Later, we found that 2-6 was also detected in the SDSS at the exact 
redshift we determined from the Hi observations. This confirms our ability to detect lines 
even in spectral regions close to RFI spikes. 



3. Results 

3.1. Detection of Hi Emitters 

To search for Hi emission lines in the final data cube, we created grayscale plots of the 
flux density as a function of Right Ascension and frequency for each of the three strips. Each 
image was searched by eye for candidate Hi emitters. Examples of two such candidates are 
shown in Fig. [6} Individual scans for each candidate were carefully inspected to minimize 
the probability of including spurious RFI into the candidate list. We found 18 detected Hi 
lines with a peak signal-to-noise ratio in the unsmoothed spectrum of at least 3, and a total 
width of the line of at least 20 spectral channels. Since the channels are mostly independent, 
the total signal-to-noise ratio of these lines is at least about 2a x ^/20 w 9. 

We assigned a subjective quality code to each detected line, ranging from 1 for the best 
detections, to 3 for detections that can not be ruled out to be spurious. The criteria for 
the quality code were as follows. Isolated, high signal-to-noise ratio lines were assigned a 
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Fig. 5. — RFI occupancy versus frequency. 
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quality code 1. Any line spectra for which the uncertainties in the baseline were judged to 
be a significant fraction of the detected line were assignd a quality code of 3. Spectral lines 
in this category have low signal-to-noise ratios, large variations in the baseline and/or are 
uncomfortably close to RFI spikes. Spectra for which such an assignment was difficult to 
make were assigned the quality code 2. We suspect that several of the detected lines of the 
lowest quality are spurious, but note that the redshifts of two of them (AUDS 2-4 and 3-1) 
were later confirmed by optical redshift measurements in the SDSS. 

For each detected line source, we extracted a total spectrum by summing up all adjacent 
spectra where flux at the corresponding redshift was detected with at least 3a significance. 
The Right Ascension of the source was computed as the center of flux from the positions 
with line emission. The extracted spectra for all 18 detections sorted by increasing Right 
Ascension are shown in Fig. [7| Each detected source is named <s~n, where s is the number of 
the strip 1, 2 or 3, and n is a running number within the strip in order of increasing Right 
Ascension. Fig. [2] shows the distribution of the sources on the sky. Each source is marked 
by an ellipse which indicates the uncertainty in the position for high signal-to-noise sources. 
The positional uncertainty is discussed in more detail in Appendix A. 



3.2. Analysis of Extracted Spectra 



Frequencies were converted to velocities using standard relations (e.g. Greisen et al. 



2006), and Hi line parameters were estimated using the MBSPECT routine in the MIRIAD 



package (Sault et al. 1995) on the spectra after applying a 15 channel Hanning smoothing. 



Errors were estimated as in Koribalski et al. (2004). The measured parameters are listed 



in Tab. [2| Total Hi content was estimated from the integrated line flux using the standard 
relation 



Mi 



HI 



2.356 x 10 5 







d 2 

a lum 



dv 



1 + z Mpc 2 J Jy (km/sec) 
where d\ uin is the luminosity distance, and v is measured optical velocity. 



(i) 



The AUDS precursor geometry of three strips along constant Declination leads to a 
substantial positional uncertainty for the detected galaxies which will not be present in the 
better sampled full survey. The sensitivity between two strips is only a few percent compared 
to the center of a strip, and the strips are largely independent. We estimate the listed Right 
Ascension to be accurate to about l/5th the HPBW of the beam, i.e. about 40 arcseconds for 
high signal-to-noise detections in the central part of each strip. The position in Declination 
is much less accurate, galaxies with large Hi masses can be detected even if they are located 
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Fig. 6. — Examples of frequency - Right Ascension images used to search for HI emission. 
The two images are grayscale plots around a high signal-to-noise ratio source (AUDS 2-3, 
left) and low signal-to-noise ratio source (AUDS 2-4, right). The detected lines are encircled 
in each image. 
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Fig. 7. — Spectra of all eighteen Hi emission lines detected in the AUDS precursor survey. 
A three channel Hanning smoothing and subsequent 3 channel boxcar smoothing has been 
applied to each raw sepctrum. Regions where RFI was present more than 5% of the time 
are not plotted because of the high probability residuals from removed RFI. Each panel is 
labeled with the name of the source n-m and the quality code, which ranges from 1 for the 
best spectra to 3 for the lowest quality spectra. 
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between the strips. In Appendix A, the positional accuracy is discussed in more detail. Our 
inability to locate the galaxies precisely causes a bias in the mass estimate which does not 
exist in better sampled maps. Since the center of the beam is more sensitive than positions 
towards the edge, the positional uncertainty also leads to an uncertainty in the measured 
flux. Hi masses computed using the gain for the center of the beam are therefore in most 
cases an underestimate of the true mass. We estimated the bias from this effect by using 



a Gaussian approximation for the shape of the ALFA beams (llrwin et al. 2009) and the 



redshift of each detected source to compute the total Hi mass corresponding to the detected 
signal as a function of Declination, M m (5). We then used the Hi mass function $(M H i) 



by Zwaan et al. (2003) to derive the probability P(S) that the detected source is at that 



position, 



m = W2 (2) 

r/2' 

Finally, we computed the expected Hi mass as 



j:i 2 /2 HM m (5))d6 



M c ™ = J M m (8)P(S)dS (3) 

The corrected Hi masses range from 2 to 11 x 10 9 M , the detailed distribution is shown in 
Fig.® 



For six sources in our sample, a unique optical counterpart has been found (see Sec. 3.3). 
For those sources, the accuracy of the bias correction can be independently checked. One of 
the sources, AUDS 3-1, is at a position about 2.6 arcmin from the center of the strip. At 
this position, the sensitivity of individual beams is only about 10%, but the effective beam 
shape of the drift scans is highly uncertain. Our model predicts that the probability that the 
detected galaxy is located that far from the center of the beam is less than 1%. The other 
five optical counterparts are on average at a distance of 0.66 arcmin from the center of their 
strip, and the average flux correction relative to the center of the strip is a factor of 1.15. 
Our bias model predicts an average most likely distance from strip center of 0.65, and an 
average correction factor of 1.20. We therefore conclude that our flux correction significantly 
reduces the biases in the mass estimate based on the sensitivity at the center of each strip. 

The measured and derived Hi parameters are listed in Tab. [2} The first line for each 
galaxy lists the measurements, and the second line gives the la error estimates for each 
parameter. In detail, the individual columns contain the following: (1) assigned name in the 
form n-m, where n is the number of the Declination strip, and m is the running number 
assigned to each galaxy in order of increasing Right Ascension; (2) and (3) coordinates of 
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Fig. 8. — Mass distribution of Hi lines detected in the AUDS precursor run. The filled parts 
of the histogram are the highest quality spectra, the shaded part is the medium quality, and 
open parts are the lowest quality spectra. 
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the detected source; (4) measured noise in the unsmoothed spectrum close to the detected 
line; (5) peak signal to noise ratio of the unsmoothed line spectrum; (6) a quality code: 1 
- best detection; 2 - likely detection but possibly affected by RFI; 3 - uncertain detection 
because of low noise or RFI; (7) center frequency of line; (8) optical heliocentric velocity 
centroid, i.e. c • z, measured at 50% of peak flux; (9) redshift z; (10) luminosity distances 
using H Q = 72 km/sec/Mpc, Q m = 0.26, and Q\ = 0.74.; (11) observed velocity width of 
line measured at 20% of the peak flux; (12) integrated line flux; (13) raw Hi mass estimate; 
(14) Hi mass estimate corrected for position bias. For sources 1-2, 2-1, 2-4, 2-5, 2-6, and 3-1, 
column (14) lists the mass computed assuming the source position is identical to that of its 
SDSS counterpart. 



3.3. Redshift Distribution and SDSS Identifications 

The redshifts of the detected galaxies range from 0.07 to 0.15, see Fig. [9] for the redshift 
distribution. Nine of the detected galaxies plus six more candidates are at a redshift above 
0.1. This sample of at least nine Hi rich galaxies at z> 0.1 is one of the largest field samples 
of Hi sources at that redshift range. Fig. [2] shows the position and redshifts of the AUDS 
precursor sources, and also shows all spectroscopic sources with redshifts in the SDSS DR7 



(Abazajian et al. 2009). 



The six Hi detections labeled 1 — 1, 2 — 1, 2 — 4, 2 — 5 and 2 — 6 which overlap in position, 
and 3-1 all have SDSS counterparts that match both in position and redshift. In order to 
search for plausible candidate galaxies for detections without matching redshifts in the SDSS, 
we carefully searched the beam area around each detection for optical galaxies in the SDSS. 
The details of the search are given in Appendix A, where we present images obtained with 
the SDSS finding chart tool at http : //cas . sdss . org/dr7/en/tools/chart /chart . asp| for 
each source. Plausible candidates for the Hi emission are marked. At least one possible 
counterpart was found for each of the Hi detections. 

To investigate whether the sizes of the identified candidates makes them plausible 
sources for the Hi emission, we plotted the r-band isophotal major axes as listed in the 



SDSS as a function of redshift (Fig. 10) and estimated Hi mass (Fig. 11). For Hi detections 



without confirmed SDSS redshift, we used the major axis of the largest of the candidates. 



Fig. 10 shows that for all Hi detections there are candidates with sizes similar to other SDSS 



galaxies at that redshift. Fig. [TT] shows that most candidates with unknown optical redshift 
are smaller than the redshift confirmed SDSS counterpart. These smaller sizes are plausible 
given the smaller Hi masses of the unconfirmed candidates. We therefore conclude that for 
all our Hi detections, plausible counterparts exist in the SDSS. 
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Fig. 9. — Redshift distribution of Hi lines detected in the AUDS precursor run. The filled 
parts of the histogram are the highest quality spectra, the shaded part is the medium quality, 
and open parts are the lowest quality spectra. For comparison, the RFI occupancy shown 
in Fig. [5] is replotted in the lower panel. 
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Fig. 10. — Distribution of r-band isophotal major axes versus the redshift of Hi and SDSS 
sources within the target region. Open triangles are the Hi sources with SDSS counterparts 
confirmed by redshift, open squares those without such confirmation, and solid squares are 
SDSS galaxies in the AUDS precursor region. For sources without counterparts confirmed 
by redshift, the major axis of the largest plausible candidate in the field was used. 
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Fig. 11. — Scatter plot of the estimated Hi masses versus the r-band isophotal major axes. 
Open triangles are the Hi sources with SDSS counterparts confirmed by redshift, and open 
squares represent those without such confirmation. For sources without counterparts con- 
firmed by redshift, the major axis of the largest plausible candidate in the field was used to 
compute the restframe size. 
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3.4. Detection Rate 



The survey samples a rather small volume. The total volume within the area covered 
within the half-power sensitivity of at least one feed in each scan and between a redshift of 
0.07 and 0.16 excluding regions heavily affected by RFI is about 2000 Mpc 3 . The number of 
galaxies detected in such a volume will vary substantially for independent volumes simply 
due to cosmic variance. We estimated the expected cosmic variance of the number counts in 



this volume from the catalog of Croton et al. (2006). The Hi mass function recovered from 



this catalog resembles the one of Zwaan et al. (2003). We extracted one thousand mock 



surveys of galaxies with the same sensitivity and volume shape as the real data. We found 
that the la variation of the mean number of galaxies in the mock survey scatter is about 
50%. 

In addition, the selected volume is not a fair sample of that redshift range because it 
was not chosen at random. Nevertheless, it is illustrative to compare the detection rate of 
galaxies to that expected from the local Hi mass function. 

For that purpose, we computed the Hi mass function from our sample using the stan- 
dard l/^ m ax method (Schmidt 1968). Volumes were computed using the same Gaussian 



approximation for the shape of the beam as in Sec. |3.2[ For comparison, we also estimated 
the Hi mass function assuming a distribution function for the Hi flux and line width. The 
results were similar to the ones shown here. The result with the l/^ max method using the 



full sample is shown as filled squares in Fig. 12, The local Hi mass function determined 



from the HIPASS survey (Zwaan et al. 2005) is shown as a solid line. It can be seen that 



the shape of the derived mass function is consistent with the HIPASS Hi mass function. We 
used a weighted fit to determine that the relative normalizations between the two Hi mass 
functions differ by a factor of 3.2±1.0. We then recomputed the Hi mass function after re- 
moving detections with the lowest quality rating. The normalization of the derived Hi mass 
function from this reduced sample is still a factor of 2.1 ±0.7 higher than the local Hi mass 
function. We adopt the mean of the two factors and the mean relative error, 5hi = 2.6 ±0.8, 
as our best estimate of the relative HI overdensity in the AUDS precursor region and its 
uncertainty. 

For comparison, we estimated the relative density of SDSS galaxies within the AUDS 
precursor survey volume. For that purpose, we extracted the SDSS photometric redshifts 
for galaxies with r-band isophotal major axes between 5 and 40 arcsec. The selection on 
the major axes was imposed to eliminate galaxies with implausible photometric redshifts 
from the sample. The limits were chosen to include all candidate SDSS counterparts, see 
Sec. |3.3| and Fig. [10} The results presented below are insensitive to the exact range of 
selected isophotal major axes. There are 55 galaxies selected in that manner within the 



- 25 - 



AUDS precursor survey volume. 

We tested the success of our selection procedure for the subsample of galaxies with 
available spectroscopic redshifts. We found that all galaxies that are within the AUDS 
redshift range based on their spectroscopic redshift, are also correctly placed within that 
volume by their photometric redshift. There is only one galaxy which is placed within that 
volume by its photometric redshift even though the spectroscopic redshift is outside that 
range. The spectroscopic redshift of that galaxy is 0.1502. We therefore conclude that the 
SDSS photometric redshifts are accurate enough to estimate the relative densities of volumes 
in the AUDS redshift range. 

We then counted the number of galaxies selected in the same manner in 100 close-by 
regions of the same size and shape. We find that on average, there are 22.3 ± 1.3 galaxies 
in regions of that size, where 1.3 is the uncertainty of the mean based on the scatter in the 
number counts. The overdensity of SDSS galaxies in the AUDS precursor region is therefore 
about 5sdss = 2.5 ± 0.4, where the error estimate includes the Poisson error on the number 
counts within the AUDS volume. Assuming that the ratio between Hi density and that of 
SDSS galaxies is independent of the local overdensity, we can estimate the mean Hi density 
at median redshift of the AUDS precursor survey in terms of the Hi density at zero redshift 
Po to be 

pm(z = 0.125) = -^p = (1.0 ± 0.3) • p . (4) 

^SDSS 

The error estimate includes only random errors. When we use the SDSS spectroscopic 
redshifts to estimate the relative overdensity in the same manner as above, the result is 
Phi(^ = 0.125) = (1.3 ± 0.6) • po- The larger uncertainty is due to the smaller number of 
galaxies with measured spectroscopic redshift. 

4. Summary and Conclusion 

We have used about 53 hours of integration time with the Arecibo telescope to demon- 
strate the feasibility of high sensitivity Hi observations with the Arecibo L-band Feed Array. 
To this depth, the noise in the final spectra still behaves as oc ^/l/i exp . RFI renders a 
significant fraction of the spectrum useless. However, regions of the spectrum where RFI is 
rarely present are little affected by the strong RFI in neighboring channels. 

In total, we detected 18 Hi emitters in the survey. For six of them, there are optical 
counterparts with measured redshifts in the SDSS. For all other detections, there are plausible 
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Fig. 12. — Hi mass function of detected Hi emitters. Filled squares are the points derived 
using all detected Hi emitters, whereas the open squares only use the highest quality detec- 
tion. The solid line is the local Hi mass function by Zwaan et al. ( 2005[ ). The dashed line 
is the local HI mass function multiplied by the relative overdensity of SDSS galaxies within 
the survey region (see text). The arrow marks the resulting density if a single galaxy had 
been detected in that bin. 
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SDSS counterparts without redshift confirmation. The detection rate is consistent with the 
total Hi density in the local universe. 

The experiment was a pilot study for the "Arecibo Ultradeep Survey" . This survey will 
be more sensitive, cover more area on the sky with a better sampling, and probe a larger 
redshift range than this pilot study. AUDS will allow to investigate in detail the Hi content 
in two independent volumes outside the local Universe. 

This research has made use of the Sloan Digital Sky Survey archive. Its full acknowl- 
edgment can be found at |http : //www . sdss . org[ 
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A. Appendix: SDSS Counterparts of AUDS Hi Detections 

The target field for the AUDS precursor survey was chosen to include galaxies that 
were likely to be strong Hi emitters as judged by the SDSS spectrum. Those galaxies were 
intentionally placed close to the center of the central strip in the most sensitive part of the 
survey. The Hi counterparts for these galaxies can easily be identified by redshift. The final 
data cube of the survey was searched for Hi detections without knowledge of the location or 
redshift of SDSS galaxies in this volume. Most of the Hi detections are lower in Hi content 
than the preselected targets, and most of them are without matching redshift listed in the 
SDSS. Identifying the optical counterparts for those sources is difficult because of the large 
beam size. 

Because of the geometry of the survey, the Declination of a source is very uncertain. The 
sensitivity of the telescope beam between the strips drops to about 10% of the sensitivity 
in the center of the strips. A strong source can therefore be detected at any position within 
the survey region, including the region between the strips. If a source is located at a large 
angular distance from the center of the strip, the measurement error in Right Ascension 
becomes dominated by the lack of our knowledge of the exact beam shape. The ALFA beam 
shape at larger distance form the center varies strongly with the pointing of the telescope 



(Irwin et al. 2009), and the effective beam shape of our repeated drifts is highly uncertain. 
An accurate estimate of the positional uncertainties as a function of signal-to-noise ratio 
is therefore difficult. For simplicity, we adopt 20% of the FWHM beam as our estimated 
uncertainty in Right Ascension for all of our sources. 

We searched for optical counterparts within 3.2 arcminutes of the Hi position in Right 
Ascension and 6 arcminutes in Declination. This corresponds to a sensitivity of 50% and 
10% of that at the center of the beam. Within that region, we selected possible optical 
counterparts from the SDSS that were classified as galaxies and had a major axis of 10 
arcseconds or more in the SDSS r-band. We manually cleaned the extracted source list by 
inspecting their images and removing misidentification such as double stars or image defects. 
If the SDSS redshift of any of the candidates was within the detected Hi line, we adopted 
this source as certain detection of a unique optical counterpart. 

In Fig. Al, we show a region of 6x6 arcmin 2 around each AUDS source obtained from 
the SDSS finding chart generator. For better clarity, the images are shown with inverted 
color scale, i.e. bluer colors represent redder galaxies. The sensitivity at the edges of the 
images is approximately 10% of the sensitivity in the center. In each image, the approximate 
size of the beam at 50% and 25% of the beam center is marked as circles. The outer part 
of the shown region corresponds to about 10% the sensitivity of the center of the beam. For 
sources with a unique optical counterpart, the counterpart galaxy is circled. For all other 
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sources, all possible counterparts are marked with circles. The diameter of each circle is the 
size of the major axis of each source as listed in the SDSS. For each source, we selected up to 
four candidates we judged to be the most likely optical counterparts based on the following 
criteria: 1) counterparts are more likely to be located close to the center of the field, 2) Hi 
line widths larger than 400 kms -1 are likely to correspond to galaxies with highly inclined 
disks, whereas those smaller than 200 kms -1 should correspond to almost face-on disks, and 
3) larger sources are more likely to contain large amounts of Hi. The most likely candidate 
counterpart galaxies are also shown separately in cutouts 20 x 20 arcseconds in size, and 
their major axis a is given. For easy comparison, we also reproduce the Hi spectrum for each 
source. In this spectrum, the measure Hi redshift is marked by an arrow at the top, and if 
available, the SDSS redshift with error estimated is marked by a bar below the Hi line. 
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Fig. Al. - SDSS counterparts. 
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The narrow line width of 197 kms -1 suggests a relatively face-on galaxy. Because of the 
low signal-to-noise ratio, we cannot exclude a substantially larger line width. The Hi 
line is also visible in the spectrum of source 1-2, which is consistent with the position 
of SDSS 235914.9+153457. Source 1-2 is within the beam, and its line is visible in the 
spectrum between 1275 and 1279 MHz. 
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The redshift of SDSS 255918.8+153641 is marked in the Hi spectrum. Source 1-1 is also 
in the beam and its line is visible at around 1272 MHz. 
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The redshift of SDSS 235936.4+155029 is marked in the spectrum . It is located close 
to the edge of the image cutout, where the sensitivity is about 10% of the sensitivity at 
the center of the beam. The relatively low Hi mass estimate of 1.7 x 1O 9 M is consistent 
with such an off-center position. 
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Fig. Al. - continued 
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SDSS 235948.4+154243 was one of the sources used to select the target region. Its SDSS 
redshift is marked in the spectrum. 
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Hi emission is between regions with frequent RFI. Both candidate SDSS counterparts 
are low surface brightness galaxies. 



2 — 3 q=l z = 23408 km/sec 



0.5 



- J 

: f 


1 1 i 
At 




~~ , , , i , , , , 1 





1315 



1320 



® 



1: 235960.0+154235 

a=23" 
2: 235955.4+154301 

a=12" 



The line width of 277 kms 1 suggests a highly inclined galaxy. This makes SDSS 
235960.0+154235 the most likely optical counterpart. 
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Fig. Al. - continued 
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The narrow line width suggest a relatively face-on galaxy, whereas the SDSS counterparts 
are fairly inclined disks. Because of the low signal-to- noise ratio, the line width could be 
underestimated. The beam overlaps with the one of source 1-4, and they share two of 
the candidate counterparts. 
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A cluster at z~ 0.152 to the north of the beam was not detected. Only one of the cluster 
members has a measured optical redshift. It is therefore possible that one of the apparent 
cluster members is the optical counterpart of the Hi emission. 
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The narrow line width suggest a relatively face-on galaxy, whereas the SDSS counter- 
parts are fairly inclined disks. The beam overlaps with the one of source 1-3. 



Fig. Al. - continued 
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Because of the low signal-to-noise ratio and the closeness of a region with frequent RFI, 
the quality of the spectrum was judged to be low. However, the exact match in redshift 
confirms the reality of the detected line. The redshift of SDSS 000011.1+154212 is 
marked in the spectrum. 



2-5 



q=l z = 33533 km/sec 



0.8 
0.6 
0.4 
0.2 



or 



0.2 





1275 



1280 



• 



000016.8+154140 
a=22" 
z= 0.1118 

SDSS 000016.8+154140 was one of the sources used to select the target region. Its SDSS 
redshift is marked in the spectrum. The measured position of source 2-5 is identical to 
source 2-6. 
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SDSS 000014.9+154342 was one of the sources used to select the target region. Its SDSS 
redshift is marked in the spectrum. The measured position of source 2-6 is identical to 
source 2-5. The line is close to a region in the spectrum with well known RFI at Arecibo. 
We used this fact to test our ability to detect lines close to RFI. 
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Fig. Al. continued 
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Because of its high signal-to- noise ratio, the positional uncertainty in right ascension 
should be small. Another potential counterpart is partially hidden by the bright star 
close to 000021.8+153909. It is not marked because the SDSS measurement of the major 
axis is not reliable. 
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The edge-on disk galaxy in the center of this beam is at a redshift of 0.115 and was not 
detected. 
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SDSS 000039.0+154551 is probably too far off center to be the optical counterpart given 
the high signal-to-noise ratio of the detected Hi line. The peaks of the double-horned 
profile of source 3-4 are visible in the spectrum at ~1274 and 1275.5 MHz. 
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Fig. Al. - continued 
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The beam overlaps with the ones around sources 3-5 and 3-3. The second line in the 
spectrum centered at ~1277 MHz originates from source 3-3. 
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The high line width of 593 kms 1 suggests a highly inclined galaxy. For that reason, 
000059.3+154034 is the most likely optical counterpart. 
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